Full Circle: Simulating Linux Clusters on Linux Clusters
نویسندگان
چکیده
BGLsim is a complete system simulator for parallel machines. It is currently being used in hardware validation and software development for the BlueGene/L cellular architecture machine. BGLsim is capable of functionally simulating multiple nodes of this machine operating in parallel. It simulates instruction execution in each node and the communication that happens between nodes. BGLsim allows us to develop, test, and run the exactly same code that will be used in the real system. Using BGLsim, we can gather data that helps us debug and enhance software (including parallel software) and evaluate hardware. To illustrate the capabilities of BGLsim, we describe experiments running the NAS Parallel Benchmark IS on a simulated BlueGene/L machine. BGLsim is a parallel application that runs on Linux clusters. It executes fast enough to run complete operating systems and complex MPI codes.
منابع مشابه
On the Implementation and Performance of the (a, t) Protocol on Linux
This paper details the design and implementation of the ( ; t) protocol for ad-hoc networks on Linux. The ( ; t) protocol utilizes adaptive clustering to organize nodes into clusters in which the probability of path failure due to node movement can be bounded over time. Based on the ( ; t) cluster scheme, routes within clusters are maintained on a proactive basis while routing between clusters ...
متن کاملParallel Simulation of 3D Nonlinear Acoustic Fields on a Linux-cluster
Simulating the propagation of 3D ultrasonic waves in a nonlinear medium is a demanding task. It requires the solution of time-dependent and nonlinear partial differential equations (PDEs). For such nonlinear PDEs, we have to use an implicit numerical method that is very CPUintensive. Parallel simulation is therefore essential for studying those ultrasonic waves with satisfactory accuracy. We pr...
متن کاملOptimizing Performance on Linux Clusters Using Advanced Communication Protocols: How 10+ Teraflops Was Achieved on a 8.6 Teraflops Linpack-Rated Linux Cluster
Advancements in high-performance networks (Quadrics, Infiniband or Myrinet) continue to improve the efficiency of modern clusters. However, the average application efficiency is as small fraction of the peak as the system’s efficiency. This paper describes techniques for optimizing application performance on Linux clusters using Remote Memory Access communication protocols. The effectiveness of...
متن کاملNetwork Performance in High Performance Linux Clusters
Linux-based clusters have become more prevalent as a foundation for High Performance Computing (HPC) systems. With a better understanding of network performance in these environments, we can optimize configurations and develop better management and administration policies to improve operations. To assist in this process, we developed a network measurement tool to measure UDP, TCP and MPI commun...
متن کاملPCIComm: A Linux Device Driver for Communication over PCI Shared Memory
Computing clusters are often comprised of a set of full computers connected by a network. Several technologies are emerging to more tightly couple the CPUs in clusters. This paper addresses a Linux device driver for an inter-node communication system. It describes a system utilizing commonplace existing hardware to perform I/O between embedded computers communicating over a shared PCI bus. The ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003